PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG016743t1
Common NameTCM_016743
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family Trihelix
Protein Properties Length: 285aa    MW: 33619.5 Da    PI: 7.4328
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG016743t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix77.22.5e-2424105186
          trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                       +W  +e+++Li +r e+e+++ ++k++k+lWe vs++mr+rg+ r+p qCk+kw+nl +ryk  ++++ ++     +++p+f++l+
  Thecc1EG016743t1  24 QWGPEETRELILIRGELERDFTAAKRNKTLWEIVSARMRDRGYIRTPDQCKCKWKNLLNRYKGKETSDPEN----GRQFPFFEELH 105
                       7**************************************************************99999974....668******98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.6192381IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.603.1E-42383IPR009057Homeodomain-like
CDDcd122034.18E-222488No hitNo description
PfamPF138374.0E-2224106No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 285 aa     Download sequence    Send to blast
MFGGGDSEGV GGRISSMLGG GGGQWGPEET RELILIRGEL ERDFTAAKRN KTLWEIVSAR  60
MRDRGYIRTP DQCKCKWKNL LNRYKGKETS DPENGRQFPF FEELHAVFTE RAKNMQRLLL  120
ESEAGSTQAK KRMRRISADR SSDEFSEEED DDEDESEEER HARSISSRKR KADRVVLDKS  180
PRPNSGTSST SSTGLQEMLR EFFQQQQRME MQWREMMERR ARERQLFEQE WRQSMEKLER  240
ERLMVEQAWR EREEQRRLRE ESRAERRDAL LTTLLNKLIN DNNL*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
2ebi_A3e-142595777DNA binding protein GT-1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007040932.10.0Homeodomain-like superfamily protein
TrEMBLA0A061G7N30.0A0A061G7N3_THECC; Homeodomain-like superfamily protein
STRINGGLYMA01G35370.21e-118(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM17512910
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G38250.11e-30Trihelix family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]